2 research outputs found
Squeeze-and-Excitation SqueezeNext: An Efficient DNN for Hardware Deployment
Indiana University-Purdue University Indianapolis (IUPUI)Convolution neural network is being used in field of autonomous driving vehicles or driver assistance systems (ADAS), and has achieved great success. Before the convolution neural network, traditional machine learning algorithms helped the driver assistance systems. Currently, there is a great exploration being done in architectures like MobileNet, SqueezeNext & SqueezeNet. It improved the CNN architectures and made it more suitable to implement on real-time embedded systems.
This thesis proposes an efficient and a compact CNN to ameliorate the performance of existing CNN architectures. The intuition behind this proposed architecture is to supplant convolution layers with a more sophisticated block module and to develop a compact architecture with a competitive accuracy. Further, explores the bottleneck module and squeezenext basic block structure. The state-of-the-art squeezenext baseline architecture is used as a foundation to recreate and propose a high performance squeezenext architecture. The proposed architecture is further trained on the CIFAR-10 dataset from scratch. All the training and testing results are visualized with live loss and accuracy graphs. Focus of this thesis is to make an adaptable and a flexible model for efficient CNN performance which can perform better with the minimum tradeoff between model accuracy, size, and speed. Having a model size of 0.595MB along with accuracy of 92.60% and with a satisfactory training and validating speed of 9 seconds, this model can be deployed on real-time autonomous system platform such as Bluebox 2.0 by NXP
OTAdapt: Optimal Transport-based Approach For Unsupervised Domain Adaptation
Unsupervised domain adaptation is one of the challenging problems in computer
vision. This paper presents a novel approach to unsupervised domain adaptations
based on the optimal transport-based distance. Our approach allows aligning
target and source domains without the requirement of meaningful metrics across
domains. In addition, the proposal can associate the correct mapping between
source and target domains and guarantee a constraint of topology between source
and target domains. The proposed method is evaluated on different datasets in
various problems, i.e. (i) digit recognition on MNIST, MNIST-M, USPS datasets,
(ii) Object recognition on Amazon, Webcam, DSLR, and VisDA datasets, (iii)
Insect Recognition on the IP102 dataset. The experimental results show that our
proposed method consistently improves performance accuracy. Also, our framework
could be incorporated with any other CNN frameworks within an end-to-end deep
network design for recognition problems to improve their performance.Comment: Accepted to ICPR 202